TANGLE SOLUTIONS FOR COMPOSITE KNOTS: APPLICATION TO HIN 

RECOMBINATION 

DOROTHY BUCK AND MAURO MAURICIO 

Abstract. We extend the tangle model, originally developed by Ernst and Sumners [18], to include com- 
posite knots. We show that, for any prime tangle, there are no rational tangle attachments of distance 
greater than one that first yield a 4-plat and then a connected sum of 4-plats. This is done by building on 
_ results on exceptional Dehn fillings at maximal distance. We then apply our results to the action of the 

N»«^ Hin recombinase on mutated sites. In particular, after solving the tangle equations for processive recombi- 

^"^ nation, we use our work to give a complete set of solutions to the tangle equations modelling distributive 

s««3 recombination. 
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1. Introduction 



^H The tangle model was introduced by Ernst and Sumners [18] as a tool for understanding the action 

of certain proteins, site-specific recombinases, on DNA. If the initial DNA molecule is circular, this 

reaction can form DNA knots and links. In this setting, the circular DNA molecules are decomposed into a 

collection of tangles, and the protein action is modelled as replacing one particular tangle P with another, R. 

Cd Information on the knot or link type of the resulting DNA can then be used in conjunction with the tangle 

H model to help determine both structural and mechanistic information about site-specific recombination. This 

'— ' approach has been carried out successfully in many situations ([5, 6, 13, 15, 18-20, 40, 41]). 

, These treatments have been restricted to the case where the DNA products are exclusively prime knots 

^ and links. However, experiments show that, in certain situations, it is possible to obtain products which 

00 are nontrivial connected sums [27]. The previous analysis relies on the Cyclic Surgery Theorem [14]. This 

■^^ approach breaks down for composite knots, essentially because their double branched cover do not have cyclic 

^j^ group. In this paper, we extend the tangle model to be able to deal with these non-prime cases. In particular, 

we prove that if the distance between the parental tangle P and the recombinant tangle R is greater or equal 



r ■ than 2, then only the obvious locally knotted solutions may occur. We also give some restrictions on the 

3^ solutions for the much more difficult distance one case. In particular, we completely describe the situation 

I for Montesinos tangles. We illustrate this by applying the results to the Hin recombinase, a protein present 

• • in the bacteria Salmonella enterica. 



L> This work is organised as follows: in Section 2, we give an introduction to site-specific recombination, 

%^ followed by a description of the Hin recombinase system, which acts as a motivation for the work to follow. 

C^ In Section 3, we give a brief review of tangles, which are the natural objects of study in topological models 

of site-specific recombination. Section 4 contains the bulk of our theoretical work: we formulate a tangle 
model for the occurrence of composite products, and we relate it to the problem of exceptional Dehn fillings 
on simple manifolds at maximal distance. Here, we show that there are no rational tangle attachments to 
a prime tangle yielding a 4-plat and then a connected sum of 4-plats. In Section 5, we solve the tangle 
equations arising from the processive Hin recombination on mutated sites. In Section 6, we combine this 
with our results from Section 4 to obtain the complete set of tangle solutions to the equations arising from 
distributive Hin recombination (Theorem 6.2). We conclude in Section 7 by discussing future directions for 
research. 

2. Biological Motivation 

2.1. Site-specific Recombination. Site-specific recombination is a fairly broad term that is used to de- 
scribe a localised protein reaction in which two double stranded DNA segments (possibly belonging to the 
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same DNA molecule) are brought together, cut, rearranged and resealed in some particular manner. This 
results in a variety of local operations on the genome of organisms, such as insertion of segments, a key step 
in viral infections, or inversion of a particular sequence, which can be a way to regulate gene expression (see 
[25] for more details). 

This process can be described in more detail as follows: in a first step, recombinases detect each individual 
specific DNA sequence and bind to it. Typically, four protein molecules bind to two sites (two molecules for 
each segment). The two segments are then brought together, either by protein action or random thermal 
motion, and juxtaposed. The juxtaposed sites are known as the crossover sites. The trapped DNA together 
with the protein is known as the synaptosome. In a second stage of the reaction, DNA strands are cleaved, 
exchanged and resealed within the synaptosome complex. A total of four such reactions occur, one for each 
strand, effectively disconnecting each crossover site. A final stage of the reaction involves the breaking down 
of the synaptosome, and the disassociation of the protein from the DNA. Before this occurs, the protein 
can immediately cleave again and perform one more round of recombination. This phenomenon is known 
as processive recombination. Naturally, proteins may act on DNA which has already undergone one or 
more full recombination events, and this is known as distributive recombination. 

2.2. The Hin recombinase. Our work will focus chiefly on the particular case of Hin-mediated recombi- 
nation, which we describe in more detail. Hin is a site-specific recombinase from the serine family. It is 
found in vivo in members of the Salmonella family of bacteria, for example Salmonella enterica. Its primary 
function is to regulate the expression of the fliC and fliB genes. These genes are involved in the synthesis of 
the flagellum of the bacteria. The flagellum is a tail-like structure present in some bacteria, and it is their 
primary source of locomotion through media. A primary constituent of the flagellum is the protein known 
as flagellin. This helps the flagellum achieve a helicoidal structure, which favours locomotion. It is precisely 
the synthesis of this protein that fliC and fliB regulate. 

Depending on the state of activity of these genes, slightly different versions of flagellin will be assembled. 
This phenomenon is known as flagellar phase variation. Recombination by Hin inverts a particular 
DNA sequence, thereby acting as an "on-off" switch for gene expression. It acts on a so-called H segment, 
containing the regions that code for fliB and fliC. It is known that, when H is in standard position, expression 
of fliC is inhibited, resulting in one type of flagellar phase, whereas when H is in inverted position, fliB is 
inhibited and fliC is expressed, resulting in the alternative flagellar variety. This is described in more detail 
by Kutsukake et al in [31]. 

if is a 996 base pair (bp) sequence, with two specific sites, each of length 26bp, at its extremes. These sites 
are designated hixL and hixR, and it is known that these are the specific sites that Hin binds to. Figure 1 
illustrates the known bp sequence for hixL. The two letter subsequence AA (or TT, depending on the strand) 
in bold is the region where DNA cleavage takes place. Cleavage occurs along the region specified by the gray 
line. The hixR site has a slightly different sequence, but it also contains the cleavage sites highlighted. 




HixR 



TTCTTGAAAACdAAGG I I I I I GATAA 
AAGAACTTTTGGTTpCAAAAACTATT 

Figure 1. a) Schematic of the H sequence, together with the hixL and hixR sequence. The 
hixL sequence, b) The hixL sequence spelt in detail, together with the cleavage sites in gray. 
Adapted from [25]. 

The Hin mediated inversion reaction is catalysed by enhancer proteins HU and Fis. In the H segment, 
there is also a 65bp long enhancer sequence. During the inversion reaction, two pairs of molecules of Fis 
bind to the enhancer sequence to help form the synaptic complex: see Figure 3 (the enhancer sequence is 



unaffected during subsequent stages of recombination). The synaptosome consists then of four subunits of 
Hin, bound in pairs to each of the recombination sites. Following the terminology of [29], we will henceforth 
name it the invertasome. 




Figure 2. Proposed model for the configuration of the invertasome. The white balls are 
the four subunits of Hin, whereas the solid black balls are units of Fis bound to the enhancer 
sequence (depicted by the white rectangle). 



2.3. Hin-mediated recombination yields composite DNA knots. In [27], Heichman et al performed a 
series of experiments with Hin in vitro, on both "standard" and mutated circular DNA molecules. To create 
the mutants, Heichman and colleagues introduced a small mutation at the cleavage site of hixL, replacing 
AA by AT. These molecules were then recombincd by Hin, and the resulting knot types were analysed both 
by agarose gel techniques and by electron microscopy^. The electron microscopy results for recombination on 
mutated sites are displayed in Figure 3. Therefore, we have a sequence of reaction products as Oi — >■ 3i — >■ 62. 
There is also the occurrence of a non-prime knot 3i^3i. The agarose gels also reveal the formation of a 7 
crossing knot, the knot type of which was not imaged via electron microscopy. 




Figure 3. Rec-A enhanced electron microscopy images of the products of the pRJ862 
reaction. One sees a trefoil knot, the 62 twist knot, and a composite knot 3i#3i. Taken 
from [27] 

The spectrum of knots observed led Heichman et al to propose the following strand exchange mechanism 
for the action of Hin: cleavage occurs at the sites, in the standard manner for a serine recombinase, and 
then a 180° clockwise turn is performed at each recombination round, as illustrated in Figure 4: this is in 
accordance with the products that were observed. With this scheme, it is also easy to explain the spectrum 
of products observed for the action on mutant sites, namely the absence of figure-eight knots and 6 crossing 
twist knots: upon performing the first round of rotation, Hin tries to reseal the strands, but it finds that 
there is a mismatch in the sequence (i.e. the strands are not complementary). This causes the system to 
undergo an additional round of recombination in order to achieve a configuration that is consistent with base 
pair complementarity: see Figure 5. 

As for the step that produces 3i#3i and other non-prime knots, this cannot come from processive re- 
combination, since this only produces twist knots, which are prime. The most likely explanation is that this 
product arises when Hin performs distributive recombination. It is conceivable that Hin could perform one 
round of recombination, resulting in 3i, before releasing DNA. It could then bind once more to form a new 
invertasome, keeping the knotted arc in the outside unbound DNA. After one round of recombination, this 
would produce 3i#3i. 

We are thus motivated to apply the tangle model of Ernst and Sumners to give mathematical confirmation 
of these predictions. Our main goals are the following: 



It was also observed that the standard type recombination yields a sequence of products of the form Oi — > 3i — > 4i — > 52- 
The tangle analysis of this system is essentially carried out in Vazquez and Sumners [40], and we refer the reader to it for 
details. 
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Figure 4. The model proposed in [27] predicts a family of products in accordance with the 
observed data. 




Figure 5. Mismatch in the DNA sequence after one attempted round of recombination. 
Taken from [27] 

(1) To describe the formation of the knots 3i, 62 and the 7-crossing knot, as far as the topological 
mechanism of strand exchange is concerned. 

(2) To describe, in terms of an appropriate tangle model, the formation of composite knots 3i#3i as 
reaction products. 

The first item will be in the same spirit of the aforementioned classical tangle model. The second step, 
however, will require a slightly new approach to the tangle model, not so much in formulation, but in the 
techniques required to solve it. We postpone the details of this until Section 4, after we have defined the 
mathematical objects under scrutiny, the tangles. 

3. Background on tangles 

Tangles were first introduced by Conway [11], as part of a scheme to develop a more efficient notation 
for enumerating knots. Since then, quite a lot of work has been done on them, and they are a rich source 
of examples in Dehn surgery, for they provide a fairly friendly way to visualise the properties of some 
3-manifolds (for a comprehensive discussion of this, see Gordon [22]). We begin by making the definition: 

Definition 3.1. A tangle is a topological triple {B, t, ip), where B is (PL or smoothly) homeomorphic to a 3- 
ball, t is a pair of arcs properly embedded in B , and ip is an orientation-preserving embedding -0 : (dB, dt) — > 
{8"^,?), where {S'^ , P) is the unit 2-sphere in Euclidean space, centred at the origin, together with a choice 
of 4 distinguished points P = {NW, SW, NE, SE}. We call ip the boundary parametrisation of (B, t). 

We will commonly refer to t as the arcs of the tangle. In situations where the boundary parametrisation 
is irrelevant, we will simply use the notation {B,t). We will consider two notions of equivalence. The first 
one, which ignores ip, will be given the name of tangle homcomorphism, whereas the second (a stronger 
notion of equivalence) will receive the name of tangle isomorphism. 
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Definition 3.2. Two tangles (_Bi,ti) and (i?2j^2) are said to be homeomorphic if there exists a honieo- 
morphism h : (Bi,ti) — > (i?2j^2) such that h{ti) = ^2- 

Definition 3.3. Two tangles (i?i, ti,i/'i) o,nd {B2,t2,ip2) ore said to be isomorphic if there exists a home- 
omorphism h : {Bi,ti) — >■ (52:^2) o,s above with the additional requirement that the following diagram 
commutes (Id is understood to be the identity map). 



Bi - 


-^ B2 







Tangles are partitioned into 3 equivalence classes, up to honieomorphism: 

Definition 3.4. Up to tangle honieomorphism, we have the following: 

(1) A tangle {B,t) is said to be locally knotted if 3 an embedded 2-sphere S in B intersecting t in two 
points and such that S bounds a (ball, knotted arc) pair. 

(2) A tangle {B,t) is said to be rational if it is homeomorphic to the tangle {D^ x I,{x,y\ x /). This 
is equivalent to the existence of a properly embedded disk D' in B such that D' separates the arcs, 
and such that no 2-sphere intersects the arcs in the manner described in the previous paragraph. 

(3) A tangle {B,t) is said to be prime if it is neither locally knotted nor rational. 

Tangles are best studied together with their two-fold covers, branched over t. Notice that the contractibil- 
ity of B ensures that these are uniquely defined once the branching set is specified, so we can speak of the 
two-fold cover, branched over t, without ambiguity. When the branching set downstairs is implicit in the 
discussion, we will abuse terminology and will refer to the cover simply as the double branclied cover of 
a tangle. The following theorem of Lickorish [34] is fundamental: 

Theorem 3.5. [34] Let {B,t) be a tangle, and {B,t) its double branched cover. Then, the following holds: 

(1) {B,t) is locally knotted-^ {B,t) is a non-prime manifold with torus boundary. 

(2) {B,t) is rational <^ {B,t) is a solid torus. 

(3) {B, t) is prime 4^ {B, t) is irreducible, d-irreducible with torus boundary. (Recall that the solid torus 
is the only irreducible, d-reducible manifold with torus boundary.) 

Rational tangles are classified up to isomorphism by the extended rational numbers Q U {oo}, in the 
following sense: we define three types of homeomorphisms of tangles 

(1) h : (B, t, V'l) ^ {B' , t' , -02) is defined as a positive Dehn half-twist about the meridian disk D^, which 
is the intersection of the plane y = with the standard unit 3-ball. 

(2) V : {B,t, tpi) — >■ {B',t',ip2) is defined as a positive Dehn half-twist about the latitude disk D\, which 
is the intersection of the plane x = with the standard unit 3-ball. 

(3) r : [B, t, V'l) ^ [B' , t' , V'2) is defined as a reflection in the plane containing NW, SE and the origin. 

Schematically, these are represented in Figure 6. 

Note that the twisting convention is that of Gordon [23]. It is opposite to that of [18]. 

We also define two special types of tangle, T(0) and r(oo), displayed in Figure 7: 
With all of this in mind, we have the following theorem of Conway [11]: 

Tiieorem 3.6. We have the following facts about rational tangles: 

(1) A tangle (B,t,ip) is rational if and only if it can be obtained from T(oo) by a finite sequence of 
horizontal twist homeomorphisms alternating with reflections taking the form h"'^rh"'^r...h'^"r, where 
the Gi e Z. Setting p/q = ai -\ "*" 1 , with {p,q) = 1, we write {B,t,ip) = T{p/q). 

(2) T{pi/qi) is isomorphic to T(p2/'?2) <^Pi/qi = P2/'72 
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Q-ax 



ft— horizontal half twist 



a 



V— vertical half twist 



r— reflection in the NW-SE plane 



Figure 6. The twist homcomorphisms on rational tangles. Adapted from [23]. 





Figure 7. The tangles T(0) (left) and r(oo) (right). Adapted from [18]. 

A rational tangle of the form T{m/l) is called integral. 

One can define two important operations on tangles. Given tangles A and B, we can form their tangle 
sum by connecting the NE of A with the NW of B via a trivial arc. Similarly, one connects the SE of A 
with the SW of B. the result is (often) a tangle, which we denote hy A + B. Also, given a tangle A, one 
can form its numerator closure by connecting the northern arcs via a trivial arc, and the southern arcs 
in the same fashion. The result, N{A), is a topological pair {S^,K), where K is some knot or link. These 
constructions are schematised in Figure 7. 





Figure 8. A + B (left) and N{A) (right). 

Given rational tangles T{ri), .., T{rn), we define the Montesinos tangle M(ri, .., r„) as T{ri) + ..+T{rn). 
It is shown in [35] that the double branched cover M{ri, ..,rn) is a Seifert fibered space over the disk with 
exceptional fibers of multiplicities l/ri, .., l/r„. 

We can now formulate the tangle model. We model the synaptosome as a ball, which represents the 
region occupied by the 4 protein molecules, with a pair of properly embedded arcs, each arc representing 
a double stranded segment of DNA. These form a tangle which we denote by P. The complement of this 
tangle in S^, together with the DNA outside the synaptosome, form a tangle which we denote by O. The 
entire DNA segment can then be represented as the numerator closure N{0 + P). We model the action 
of the recombinase as replacing the tangle P by another tangle R. Therefore, one round of site-specific 
recombination can be described by the pair of tangle equations 



N{0 + P)=Kq 
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N{0 + R) = Ki 
for some knots Kq, Ki. It is straightforward to extend these to m rounds of processive recombination. 
At the ith recombination step (i > 1), we think of recombination as replacing the tangle {i — 1)R with iR, 
where mR is understood to be i? + ... + i?. Hence, we have 

in times 

N{0 + P)=Ko 
N{0 + R) = Ki 
N{0 + 2R) = K2 

N{0 + mR) = K^ 
The purpose of the model is then to determine the tangles P, R and O, given the knot type of the Ki. 
Usually Kf) — Oi, that is, the initial DNA is taken to be unknotted. 

Following the discussion in [40] , we remark that experimental evidence seems to indicate that Hin behaves 
in a more complex way, that of a 3-string tangle (see Figure 2). These objects are still not well understood, 
although there has been recent progress - e.g. see the work of Cabrera-Ibarra [8-10] and Darcy et al [16]. 
Nevertheless, there exists biological evidence pointing to the fact that the enhancer sequence does not play 
an active role in the recombination process [29]. Therefore, it is possible to perform an isotopy that removes 
it from the region modelling the invertasome, justifying the treatment of P and R as 2-string tangles. 

Solving tangle equations in full generality is a very hard problem. Fortunately, the knots usually observed 
in DNA recombination experiments are 4-plats (also known as 2-bridge knots), and this makes the problem 
more tractable. Given enough reaction steps, it is often possible to uniquely determine O and R. 

The strategy for solving tangle equations is the following: one uses the information on knot types to argue 
that P and R are rational tangles. This means that their double branched covers are solid tori. Therefore, 
we convert the tangle analysis into a Dehn surgery problem. That is, we can think of attaching a solid torus 
T~^{P) (resp. R) to O. Here, t denotes the double branched cover map. Then, results on Dehn surgery 
are applied to (hopefully) conclude that O must be a solid torus, implying that O is a rational tangle. A 
computation, as described in [18], then yields the tangle solutions. 

3.1. Tangle equations for Hin action. In the light of the results of Heichman's experiment, we propose 
the following tangle equations for processive Hin recombination on mutated sites 

(1) 7V(0 + P)= 5(1,1) 

(2) N{0 + R) = b{3,l) 

(3) N{0 + R + R)=b{7,3) 

(4) N{0 + R + R + R)^K 

Here, we have switched to the 2-bridge notation of Schubert. Therefore, 6(1, 1) denotes the unknot, 
6(3, 1) denotes the trefoil knot, and 5(7,3) denotes the 82 knot. K is a 7-crossing knot, and its knot will 
be determined as a corollary of our results. The faintness of the line in the agarose gel corresponding to 7 
crossing knots (sec [27]) supports the suggestion that K is the product of 3 rounds of recombination, since 
one should expect a dramatic decrease in the density of products as the number of recombination rounds 
increases. 

As stated in the previous section, we will also be interested in the 5(3, 1)#5(3, 1) case, which we postulate 
arises from distributive recombination. To capture the essence of this phenomenon, we propose the following 
tangle equations 

(5) Ar(Q + P) = 5(3,1) 



(6) iV(Q + i?) = 6(3, 1)#5(3, 1) 

Notice that Q is not the same as O. These are two different systems, which must be treated separately. 
Nevertheless, the information on P and R is crucial for the analysis, and it is part of the motivation for 
considering the first system. 

In the next section, we tackle the equations for distributive recombination in a much more general form, 
namely 

(7) N{Q + P)=Ki 

(8) N{Q + R) = i^2#i^3 

where the K^ are 4-plats, and P, R are rational. We postpone the solution of Equations (l)-(4) until 
Section 4, where we also solve Equations (5) and (6) as a consequence of the work in Section 3. 

4. Solution of tangle equations yielding composite knots 

Our goal in this section is to give results on the classification of tangles O which arise as solutions to the 
tangle equations presented below: 

N{0 + P)=Ki 
N{0 + R) = K2#K3 
where the Ki are 4-plats, and P, R are rational. K2 and K^ are both assumed to be nontrivial. 

Note: O cannot be a rational tangle, since the sum of two rational tangles gives a 4-plat. So we consider 
O to be either locally knotted or prime. 

4.1. Locally knotted solutions. We start by looking for locally knotted solutions. We begin by noting 
that, since (ball, knotted arc) pairs persist upon tangle summation, Ki — K2 or Ki — K3 for locally knotted 
solutions to occur. If that is the case {Ki = K2, say), we can excise the common (ball, knotted arc) pairs in 
both equations and we are thus reduced to 

N{0' + P) =0i 
N(0' + R) = K3 

where Oi is the unknot. O' is a tangle from which a (ball, knotted arc) pair has been excised. Now, we can 
interpret our tangle equations as Dehn surgery on a strongly invertible knot K in S^, the core of the solid 
torus given by t~^{P). Our notation for Dchn surgery on a manifold O with a torus boundary component, 
along a slope a, is 0{a). If K3 = b{p, q) then taking double branched covers gives 

O'(a) == 5^ 

d'iP)^L{p,q) ^ 

where a, /3 are slopes (simple closed curves up to isotopy) in dO' . Note that O' is either rational or prime. 
We would like to conclude that O' is a solid torus (and hence O' is rational), but this is not always true. 
There are numerous examples of nontrivial strongly invertible knots having lens space surgeries. Following a 
procedure explored by Bleiler [1], we can construct, given a strongly invertible projection of a knot possessing 
a lens space surgery, examples of prime tangles having rational tangle attachments yielding 4-plats. The 
reader is referred to Bleiler's work [1] and to Gordon [22] for details on this construction. 

Recent results of Kronheimer et al [33] show the following: if \p\ < 9, then the only non-trivial knot 
which possesses a L{p, q) surgery is the trefoil. We note that site-specific recombination usually produces 
2-bridge knots of low complexity {\p\ < 9). This, together with a complete knowledge of surgeries on torus 
knots as given by Moser [36] and the remarks in the preceding paragraph, can help one enumerate locally 
knotted solutions to the tangle equations (7) and (8) for the case \p\ < 9. Whereas it would be impractical 
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to explicitly formulate a classification theorem, individual cases can be treated with relative ease, and the 
details are left to the reader. 

4.2. Prime tangle solutions. To classify prime tangle solutions we use two different strategies depending 
on the distance between P and R. The distance between two rational tangles P — T{p/q) and R = T{r/s) 
is defined as d{P^ R) — \ps — rq\. The distance between P and R is related to the distance between the Dehn 
filling slopes which they determine as follows: 

Lemma 4.1. Let P and R be rational tangles. Consider N{0 + P), N(0 + R), and the corresponding lifts 
0{a), 0{l3). Then, d{P, R) ~ A(a,/3), where A(a,/3) is the minimal geometric intersection number between 
the slopes a, /? G dO. 

Proof. Let us consider the boundary of a tangle T. The SE — SW arc lifts via the branched covering 
map to an essential simple closed curve fci S dT. Likewise, the NE — SE arc lifts to an essential simple 
closed curve fc2- It is clear that these two curves form a basis for Hi{dT). Now, let us fill the boundary 
of T with a rational tangle p/q. Ernst [17] shows that the curve that generates the kernel of the inclusion 
i* : Hi{dT{p/q)) — >• Hi{T{p/q)) is generated by pki + qk2. Thus, for Ti, pki+qk2 bounds a disk, and for T2, 
rki + sk2 bounds a disk. Thus, the distance between the fillings is just the geometric intersection number of 
these two curves, and the result follows. D 

From now on, we will treat the cases d{P, R) > 1 and d(P, R) = 1 separately. 

4.2.1. Case d(P, R) > 1; To begin, we calculate the double branched cover of a connected sum of 4-plats. 

Proposition 4.2. Given b{p,q)jj=b{r,s), then it is double branch covered by L(p,q)=ffL{r, s). 

Proof. Let L{p,q) be the double branched cover of b{p,q). Remove a ball from S^ intersecting Ki in an 
unknotted arc. The resulting operation upstairs is to puncture Ki, since a ball double branch covers itself 
branched along an unknotted arc. Thus, we have that the double branched cover of a "ball with knotted Ki 
arc" is a punctured Ki. The same goes for K2. Gluing two of these balls along their boundary gives the 
pair {S^,Ki^K2), and the corresponding operation upstairs is to glue the punctured branched covers along 
their S*^ boundary. The result follows. D 

Translating our problem to the double branched cover setting, we have the following Dehn Filling problem: 

(9) d{a)=L{p,q) 

(10) diP)^L{r,s)ifL(t,v) 

where O is irreducible, 9- irreducible, and d{a,/3) > 1. We now apply results on exceptional Dehn fillings 
at maximal distance. In particular, the following result of Boyer and Zhang [2] is relevant: 

Theorem 4.3. ([2], Theorem 1.2.1) Let M be a compact, connected, orientable, irreducible 3-manifold with 
DM a torus. Assume that M is neither a simple Seifert fibered manifold nor a cable on I(K) (the twisted 
I-bundle over the Klein bottle). Fix slopes ri and r2 on DM and .suppose that M{ri) is a reducible manifold. 
If M{r2) has a cyclic fundamental group, then d{ri,r2) < 1. 

In order to apply this result to Equations (9) and (10), we must check that O satisfies the additional 
Seifert fibration and twisted /-bundle conditions. We say that a manifold is simple if it has no spheres, 
disks, tori or annuli that are essential. We will proceed as follows: recall that a non-simple annular manifold 
with torus boundary is cither a small Seifert fibered space or toroidal. We will show that O cannot be a 
Seifert fibered space (SFS). Then, we will show that O is atoroidal, which implies that it must be simple. 
This will show that O is simple, which will allow us to use Theorem 4.3. 

Proposition 4.4. Let M be a SFS such that dM is a torus, and suppose that, for some slopes a, j3, 
M{a) ^ L{p,q) and M{(i) = L{r,s)ifL{t,u), with r,t ^ I. Then d{a,l3) = 1. 

Proof. Wc will consider two cases: 



(1) r y^ 2 or t y^ 2: Suppose that M admits a Seifert fibration. M{/3) is the result of performing 
Dehn fiUing on M along a slope (3 in dM. This filling always extends the fibration unless /3 is 
isotopic to a regular fiber (see [4]). Now we must distinguish two subcases: if both r, t = 0, then 
M{(3) — S^ X S^ifS^ X S^. This clearly does not admit a Seifert fibration. This implies that (3 is 
the unique slope that destroys the fibration, and so M{a) extends such a fibration. This implies 
(see [17]) that the orbit surface of M{a) is either a torus or a nonorientable surface of genus two. 
This is impossible, since lens spaces can only fiber over 5*^ or RP"^ (here we are taking L(0, 1) as a 
lens space). If at least one of r,t is not equal to zero, then M cither fibers over the disk with two 
exceptional fibers (M cannot be a solid torus) or over MP^ with one exceptional fiber. In either 
case, attaching a solid torus as determined by a must add a regular fiber in order for us to obtain a 
lens space. Notice that once again /3 destroys the fibration, so it must be isotopic to a fiber H on 
dM. Choosing a crossing curve Q for dM and denoting a regular fiber by H, we must have that 
a ^ Q + vH, and /3 ^ H, whence d{a, (3) = 1. 

(2) r = t = 2. Here L(2, 1)#L(2, 1) = RP^^^RP^ can indeed be given a Seifert fibered structure. It is 
the orientable circle bundle over RP^, and thus it fibers over RP"^ with no exceptional fibers (see 
[4]). Therefore two situations may occur: either M fibers over the disk with two exceptional fibers 
and we have obtained L(2, 1)^L{2, 1) by gluing a solid torus V such that a meridian of V is attached 
to a fiber H on dM, or M is the orientable circle bundle over the Mobius band. If the former holds, 
then we proceed as in case (1) to conclude that d{a, /3) ~ I. If the latter holds, notice that there 
is an alternative fibering over the disk with two exceptional fibers, and hence this reduces to the 
previous situation, so we are finished. 

D 

Now, we show that O is atoroidal. First, note that O cannot contain non-separating tori, for these would 
remain non-separating after Dehn filling, contradicting the fact that all tori in lens spaces are separating. 
Next, we appeal to the following result, which can be found in [26]. This is a preliminary lemma in the proof 
of the JSJ decomposition Theorem. The result, we believe, is due to Kneser. 

Theorem 4.5. Let M be irreducible, compact and connected. Then, there exists a finite disjoint collection 
of tori T such that each component of M\T is atoroidal. 

Since all tori in O are separating, the operation described above breaks O into a disconnected collection 
of atoroidal manifolds. These are clearly, irreducible, 9-irreducible. Since all boundary components are tori, 
each piece is either a small SFS or simple, as remarked before Propoposition 4.4. 

In order to aid our discussion, let us define the following: 

Definition 4.6. A splitting graph of O along T is a graph Q whose edges are the essential tori in T, and 
the vertices are connected pieces of the decomposition. Two vertices are connected by an edge if and only if 
some incompressible torus in T separates their corresponding components, and T lies in both components. 

Lemma 4.7. The splitting graph for O is a tree. 

Proof. Recall that a finite connected graph is a tree if and only if removing any edge disconnects it. That Q 
is connected follows from the fact that O is connected. Since all of our tori are separating, deleting one of 
them will disconnect O. This implies that the splitting graph for this disconnected manifold is a disconnected 
graph, and the result follows. D 

Now, we let the vertex corresponding to the connected component that contains dO be the root of the 
splitting tree, and we can begin the analysis. Recall that in a tree, the concept of height of edges and 
vertices is well defined, once a root is chosen. We define the level of a vertex to be the number of edges 
that separate it from the root. We define the level of an edge to be the height of whichever of its vertices 
is closer to the root (where closer is to be understood in terms of levels). Also, note that there is only one 
vertex on the 0*'' level: this is the root. 

We will number vertices as follows: v'- will denote the fc*'* vertex on the j*'* level, where the ordering at 
each level is chosen arbitrarily (we will soon see that this is irrelevant). An edge e^ will be labelled similarly. 
We will abuse notation and identify vertices with pieces of the decomposition and edges with essential tori. 
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Note that if Q consists of a single vertex, there is nothing to prove: in this case, O is atoroidal, and hence 
simple, by previous discussions. So we may assume that the set of edges is nonempty. We can now state our 
first result. 

Proposition 4.8. vq is a SFS over the annulus with at most one exceptional fiber. 

Proof. As remarked above Proposition 4.4, vq is either simple or a small SFS. Let us deal with the simple 
case first. 

Case 1: Suppose vq is simple. We may assume that wq has more than one boundary component: these are 
do and {eolj-Zi", where fco is some positive integer. Now, perform the a and /3 fillings along 90, and denote 
the manifolds obtained by a and /3 filling on fo as wo(a) and uo(/3), respectively. We know that L(p, q) and 
L{r,s)^L{t,v) are atoroidal, and hence all the Cq must compress in L{p,q) and L{r,s)^L{t,v). 

We claim that at least one of the eg must compress in vo{a), and the same for vo{P). Let us prove the 
claim: let I? be a compressing disk for ej in L{p,q). Since ej separates O, int{D) is either contained in 
the component that contains dO or in the other component. If int{D) is in this other component, that 
means that ej compresses in O, contradicting the fact that it is essential. Thus we may assume D is in the 
component which contains dO, but not necessarily in wo(a). This will happen if and only if D intersects 
some of the Cq. So suppose D intersects some of the other Cq- This intersection is a finite collection of loops. 
Let / be an innermost loop. We know that I must then bound a disk E G D, and the boundary of this disk 
intersects, say, e™. Notice that this disk doesn't intersect any other of the Cq. Now, we apply an innermost 
loop argument: if this disk is inessential in e™, we may delete its interior from e™, push dE away from 
e™, and glue back another disk disjoint from e™, thus reducing the number of intersections. Hence, we can 
assume that / is an essential loop in e™ which bounds a disk. This is a compression disk for e™. Repeating 
the argument at the start of the claim, we have that this disk is contained in the component to which dO 
belongs. Since E is taken not to intersect any of the other Cq, we have constructed a compressing disk for 
e™ in fo(a)- Exactly the same argument goes for wo(/3), and the claim is proved. 

Now, we have a pair of fillings a and /? for a simple manifold vq such that A{a, (3) > 1 and such that 
Vq (a) and vo (/?) are 9-reducible. This contradicts results of Wu (Theorem 1 of [42] ) , and so vq cannot be 
simple. 

Case 2: vq is a small SFS. We've already remarked that vq has at leat two boundary components. Going 
through the list of small SFS (see [22] ) , we find that the only possibilities are a SFS over the pair of pants 
with no exceptional fibers, or a SFS over the annulus with at most one exceptional fiber. We must rule out 
the pair of pants. 

Suppose Wo is a SFS over the pair of pants. We are given two different Dchn fillings on uo- A filling with 
slope a "destroys" the fibration if and only if a is attached to a curve homologous to a fiber. Since we have 
two fillings with different slopes, at least one of them will "respect" the Seifert fibration and WLOG we see 
that vo{a) is a SFS over the annulus. We can now apply the same reasoning as in the simple case and deduce 
that at least one of the boundary components will compress in vo{(x). Now, it is known that an irreducible 
manifold with a compressible torus boundary component is a solid torus. Thus, fo(a) is a solid torus, and 
this is clearly a contradiction. This completes the proof. D 

We must remark at this point that the case where vq is a SFS over the annulus with at most one exceptional 
fiber can occur: we can regard a SFS over the annulus with at most one exceptional fiber as the result of 
removing the neighborhood of a (p, q) torus knot from a solid torus V, where this torus knot lies on the 
boundary of a torus concentric to dV. q/p is the invariant of the exceptional fiber. Denote this space by Cp_q 
(this is often known as a cable space). Performing Dehn filling on a boundary component yields a Seifert 
fibered space with at most two exceptional fibers for all slopes except one. Standard facts about Seifert 
fibered spaces can then be used to determine exactly when this fibration is that of a solid torus (that is, 
when no exceptional fibers are added). See [4] for details. 

It might look that we cannot make any further progress with our analysis at this stage. However, we can 
get around the difficulties as follows: we claim that vo{a) and wo(/3) a-i'e both solid tori. Thus, we can think 
of Wo (a) and wo(/3) £^s inducing two Dchn fillings on wi, which we will denote by vi{j), vi{S). If we can show 
that the distance between these fillings is not 1, we can apply Proposition 4.8 inductively to show the tree 



is linear, with a terminal node w„ which is a SFS over the annulus with at most one exceptional fiber and a 
pair of induced fillings on w„ at distance greater than 1. 

Lemma 4.9. uo(a) and wo(/3) are solid tori. 

Proof. We know that at least one of the fillings will extend the Seifert fibration, so, WLOG, assume that 
Wo (a) is a SFS. In particular, it is irreducible. Exactly the same argument as for the pair of pants implies 
that vo{a) is 9-reducible, and hence a solid torus. 

Now, if Vo{l3) also respects the fibration, there is nothing to prove. So assume that the fibration does not 
extend to wo(/3). Choose a pair of curves H, Q on the component T of Ovq where the filling takes place. Here, 
_ff is a fiber, and Q is a crossing curve. These form a basis for Hi{T) (see [4] for details on crossing curves). 
Thus, in the a filling, a meridian gets glued to a curve ^Q + vH . In the j3 filling, a meridian gets glued to 
H. However, we showed before that wo(a) is a solid torus, and this cannot fiber with two exceptional fibers. 
This, together with the fact that vq has one exceptional fiber, implies that the a filling adds a regular fiber, 
and hence fi = 1. We thus have that d{a, /3) = |1 — 0-i^| = l. This contradicts d{a, /?)>!, and hence wo(/3) 
must be a solid torus as well. Thus, the claim is proved. D 

We must now compute the distance between these fillings wi(7), Vi{5). Essentially, this is figuring out 
which slopes on dv(){a), dv(){f3) bound disks. We use the following notation: let J be a simple closed curve 
in a solid torus V which does not lie inside a ball, and let Y = V — N{J), where N(J) is a regular open 
neighborhood of J. We denote by (J; r) the result of performing r = m/n Dehn surgery on Y on the 
boundary component created by removing N(J). 

The following result of Gordon is useful: 

Proposition 4.10. (^[24], Lemma 3.3) Let {J',r) and V be as above. Furthermore, let ^, \ be a merid- 
ian/longitude basis for dV . Then, the kernel of the inclusion i^ : Hi{d{J;r)) — > Hi{{J;r)) is generated 
by: 

{ nui^ y^ _i m CJ ^ 

Here, lo is the winding number of J in V. We can now use this to prove: 

Proposition 4.11. A(7, 5) > 2. 

Proof. Let a = m/n, (3 = r/s, with a, (3 as before. Then the previous proposition allows us to express 7 
and 5 in terms of coordinates on dV . We obtain that a meridian in ^0(0:) is a m/nuP' curve. Similarly, a 
meridian in wo(/3) is a r/sw^ curve. Hence, A(7, (5) — \ijj^{ms — rn)\ =cj^A(a,/3). Since in the case of cabling 
spaces cj 7^ 0, the result follows. D 

This allows us to complete our main argument: 

Theorem 4.12. O is atoroidal. 

Proof. We've shown that Wo is a SFS over the annulus, and that VQ{a) and Vq{P) are solid tori. We can 
now pass the analysis to vi , and so we have two Dehn fillings vi (7) and vi {5) . By the previous proposition, 
^(7j <5) > 2. We can now apply the argument of Proposition 4.8 and conclude that vx is a SFS over the 
annulus with at most one exceptional fiber. Carrying out this procedure inductively we eventually reach the 
terminal node w„ of our tree. We can easily see that w„ cannot be simple, and it must therefore be a SFS 
over the disk with at most two exceptional fibers. But this is the situation of Proposition 4.4: w„ cannot 
be a SFS. We are forced to conclude that the collection of essential tori promised by Theorem 4.5 must be 
empty. That is, O is atoroidal. D 

Corollary 4.13. O is simple. 

Proof. This is an immediate consequence of Proposition 4.4 and Theorem 4.12. D 

And finally, we obtain: 



Corollary 4.14. There are no prime tangle solutions to Equations (7), (8), with d{P,R) > 1. 

Proof. We've shown that O is simple. Hence we can apply distance bounds on exceptional Dehn fillings. 
We have Dehn fillings 0{a) yielding a lens space and 0(/3) yielding a reducible manifold. This implies that 
A(q;,/3) < 1, by Theorem 4.3. The result follows. 

D 

4.2.2. Case d{P,R) = 1. Here, the Dehn filling techniques used above break down, and in fact the bounds 
obtained are best possible, in the sense that there are examples of reducible and cyclic fillings at distance 
one. The class of prime tangles (respectively irreducible, 9-irreducible manifolds) is too large for one to 
hope for a complete solution to the problem. However, we can offer a complete solution for some families 
of tangles. We do this for the Montesinos tangles. This is done by classifying all solutions at distance one 
when O is a Seifert fibred space. 

Theorem 4.15. Suppose that O is a SFS, and that 

(11) d{a)=L{p,q) 

(12) 0(/3)=i(r,s)#L(t,u) 

where (r, s), (t, u) are given pairs of integers. Then, p = st + ru + rtm [m G Z), and q = sk + ry, where 
k, y are any integers such that k{u + tm) — yt = 1. 

Proof. We've already remarked that the case r — t — cannot happen. Suppose that both r and t are not 
equal to zero. Then, by the remarks in Proposition 4.4, O fibers over the disk with exactly two exceptional 
fibers, of type r/s and t/u (see [4]). These spaces are double branch covers of the Montesinos tangles. We 
can exploit this fact to simplify our computation. The Seifert fibred space over the disk with two exceptional 
fibers of type r/s, t/u is a double branched cover of the tangle O — T{^) + T{j). It is a straightforward 
matter to verify that N{0 + T(oo)) is a nontrivial connected sum. This is the only tangle addition that has 
this property, and it corresponds to adding a fibered solid torus such that the meridian gets glued to a fiber 
H in do. On the other hand, the fillings that give lens spaces correspond to the integral tangle additions 
downstairs. Using the facts that r(f ) + T{m) = T^i^^) and iV(f + §) = ^i ad/Xbl' )' ^^^ ^^^ c' and d' 
such that c'd - cd' = 1 (see [18]) we have that N{f + ^^) = JV( ''*+^^+^*"' ), where fc,y are such that 
k{u + tm) — yt = 1. This gives all the possibilities as one varies m. D 

This result can easily be translated into the language of Montesinos tangles. 

Corollary 4.16. With p = r = t, Equations (11), (12) are impossible if O is a Seifert fibered space. 

Proof, li p = r = t, then we must have 2p + p'^m = p, whence pm = —1. Since we are assuming that |p| > 1, 
we have a contradiction. D 

5. Solution for processive recombination 

Here, we apply the results of Ernst and Sumners [18] to equations 1-4, corresponding to processive 
recombination of Hin on mutated sites. Although these results have not been published, they are known to 
the community, and we display them here to illustrate the tangle model. 

Note: We will not solve here the tangle equations for Hin acting processively on standard sites. Biologi- 
cally, this is completely analogous to the action of the protein Gin, characterised by Vazquez and Sumners 
[40]. It yielded the solution O ^ T(-l/2), R = (1). 

As discussed in Section 3, we show that the tangles involved must be rational. A straightforward obser- 
vation based on the unique factorisation of knots implies that O, P and R cannot be locally knotted. Thus 
we are left to rule out prime tangles. 

The proof that R must be rational is identical to Lemma 3.1 in [18]. In the following, we show that O 
must be rational, as well. 
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Proposition 5.1. O is rational. 

Proof. We know O is not locally knotted, so suppose O is prime. Then, it can be shown that P is rational 
([18], Lemma 3.1). This implies that O is the complement of a non-trivial knot S^ (see [1]). Suppose that it 
is the complement of a non-torus knot, say O = M^. Equations (2) and (3) imply, in the double branched 
cover setting, O^gVi = L(3, 1) and O U;i V2 = L(7, 3), i.e. we have two Dehn surgeries on K yielding 
lens spaces. By the Cyclic Surgery Theorem (see [14]), they must be integral. Hence, 0{m) = L(3, 1) and 
0{n) = L(7, 3), for some integers, m,n. An easy homology argument implies that m = 3,n — 7. But this 
contradicts the fact that the integers must be successive. Hence we may assume that K is a, (r, s) non trivial 
torus knot. Moser [36] shows that, for p/q surgery on a (r, s) torus knot, we obtain a lens space if and 
only if \rsq + p| = 1, and the resulting lens space is L{p,qs^). Now, r,s are positive integers, and we have 
\p\ ~ rs\q\ ± 1. Since we have L(3, 1) as a result of surgery, we wish to solve |3| — rs\q\ ± 1. But for a 
torus knot to be non-trivial, we must have r > 3, s > 2, and hence the above equation cannot have integer 
solutions. We conclude that K must be the unknot. The result follows. D 

We now calculate the tangle solutions. In the following discussion, we warn the reader that wc will not 
take chirality into account, and so we will take liberties such as to consider 6(3, 1) ~ 6(3, —1). 

Theorem 5.2. Given Equations (l)-(4), then either O = T(l/2) and R = T(-2), or O ^ T{-l/2) and 
i? = T(2). 

Proof. This boils down to a computation ([18], Lemma 2.1). We first check that R must be integral. Suppose 
not. Then, the double branched cover of N{0 + R + R + R) is a Seifert fibered space over S"^ with at least 3 
exceptional fibers. Now, all prime 7-crossing knots are 4-plats (see the table at the end of [7]), and we know 
that lens spaces cannot fiber with more than two exceptional fibers. Hence we obtain a contradiction, so R 
is integral. 

Let R = T{r). Clearly, R + R^ T{2r). Let O = T{u/v). Applying Lemma 2.1 from [18], we obtain the 
following equations |u -I- fc| =3 and \u + 2fc| — 7, where k — rv. Solving gives the following solution pairs: 
{u,k) = {(1,-4), (—1,4), (2,-5), (—2,5)}. We must now check case by case. 

Suppose r — l,u = —4 (the reverse sign case is essentially the same, but with mirror images). We have, 
upon checking, N{0+R) = 6(3, 1), N{0+R+R) = 6(7, 3), which is consistent. However, N{0+R+R+R) = 
6(11,3) ~ 6(11,4). From tables [7], this is Stevedore's knot, a 6-crossing knot. This is inconsistent, so we 
discard this solution. 

Suppose r — 4:,v = —1. We obtain N{0 + R + R) = 6(7, 8) ^ 6(7, 1), which is inconsistent, so we discard 
this solution. 

Suppose r = 2,v = -2. Wc obtain N{0 + R) ^ 6(3, 1), N{0 + R + R) ^ 6(7, 3), N{0 + R + R + R) ^ 
6(11,6) ^ 6(11,-5) ^ 6(11,5). This is consistent, so we have found a solution (r — — 2,w = 2 is also a 
solution, and this gives mirror images). The other cases are checked similarly, and all the solutions are 
discarded. D 

Corollary 5.3. Under the hypotheses of the tangle model, N[0 + R + R + R) — 6(11, 5). 

Proof. This is a straightforward computation. D 

We've isolated the unique tangle solution, modulo mirror images. We can now decide on the chirality 
by looking again at Figure 3. It is clear that the trefoil obtained is a left-handed one. As only the pair 
O = T(-l/2), R = r(2) yields N{0+R) = 6(3, 1) as the left handed trefoil, we take it as the solution. Notice 
that this determines the handedness of all subsequent products of processive recombination. Projections of 
the solution tangles are displayed in Figure 9. 

Unfortunately, little can be said about the tangle P. In fact, it admits infinitely many solutions (see 
[18]). This is essentially due to the fact that it only appears once in the equations, and this does not give 
enough information to determine its type. Following Vazquez (see [40] and references therein), we adopt the 
common approach and set P — T(0). 

The results obtained confirm the model of Heichman et al [27] that recombination of the mutated sites 
occurs by a 360 degree clockwise twist at the recombinations sites, represented by the tangle T(2). The 
T(— 1/2) solution is consistent with the fact that the enhancer sequence does indeed acquire the predicted 
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Figure 9. Solutions for processive recombination. 

relative position with respect to the crossover sites. This is in agreement with the treatment of Vazquez [40] 
The solutions are schematised in Figure 10. 
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Figure 10. Diagram of an isotopy of the pre-recombination DNA into N{T{-l/2) + T(0)). 

6. Solution for distributive recombination 
We now tackle the equations for distributive recombination. Recall from Section 3 that these were 

7V(Q + P)= 5(3,1) 

7V(Q + i?) = 6(3, 1)#6(3, 1) 

This will characterise, topologically, all the configurations of the outside bound DNA during distributive 
recombination steps. We take P — T(0) and R — T(2), as determined in the previous section. This is essen- 
tially the assumption that the mechanism of action of the protein is independent of whether recombination 
is processive or distributive. This reflects the experimental evidence [29]. 

To prove Theorem 6.2, we need the following result of Ernst and Sumners: 

Lemma 6.1. (Lemma 3.6 [18]j Let A = r(0) and B = T{a/j3), and suppose X is a tangle such that 
N{X + A) — 6(1, 1) and N{X + B) ^ b{p, q). Then. L{p, q) can be obtained by (/3 + so) /a Dehn surgery on 
Mk — X , where s G Z. 

Theorem 6.2. Given that P — T{0), R — T{2), the following are the complete tangle solutions for Q to the 
equations: 

7V(Q + P)= 6(3,1) 
7V(Q + i?) = 6(3, 1)#6(3, 1) 

(1) Rational tangles: There are no rational tangle solutions. 

(2) Prime tangles: There are no prime tangle solutions. 

(3) Locally knotted solutions: The only 4 possible solutions for Q are displayed in Figure 11. 

Proof. We treat each case separately: 

(1) This is a straightforward observation, already remarked in Section 4. 

(2) This follows from Corollary 4.14. 

(3) We use the procedure described at the beginning of Section 4.1 to obtain a one-to-one correspondence 
between locally knotted solutions and solutions to the equations obtained by excising a common knotted 
(ball, arc) pair. 

15 







Figure 11. The four topological possibilities for Q. Note that only A and B are consistent 
with experimental evidence. 



N{Q + R) = b{3,l) 

Here, Q is either rational or prime. Suppose Q is prime. Then, its double branched cover is a knot 
complement Mx- Suppose K is not a torus knot (i.e. Q is not a SFS). By Lemma 6.1, we can obtain 
L(3, 1) by performing (1 + 2s)/2 surgery on K, contradicting the Cyclic Surgery Theorem (which asserts 
that the surgery must be integral). Hence we may assume that X is a torus knot. Applying exactly the 
same argument as in Proposition 5.1, we conclude that K is the unknot, i.e. Q is rational. 

It is now a routine matter to enumerate the possibilities for Q. Let Q = T{u/v). Applying Lemma 2.1 
of [18] to both equations with P = T(0) and R = T{2) we get \u\ = 1 and \u + 2v\ = 3. Solving gives 
the rational numbers u/v = {1/1, —1/2}. Reintroducing the excised trefoils, we obtain the desired locally 
knotted solutions. 

D 

The results suggest that, with the exception of solutions C and D in Figure 11, the action of Hin during 
distributive recombination is essentially identical to that of processive recombination. In particular, this 
seems to indicate the formation of composite knots occurs as Hin traps a DNA molecule which has already 
undergone one round of recombination, with the existing DNA knot being localised outside of the inverta- 
some. The invertasome maintains a configuration that is identical to the processive recombination steps. 
This is consistent with the fact that Hin is a highly restrictive protein, and the invertasome tends to be 
productive only under very special conditions [29]. These results are what one would naturally expect of 
distributive recombination yielding composite knots. 

The undesircd solutions illustrated in Figures IIC and D do not seem to conform to any previously 
reported biological mechanism [29]. In particular, they induce an antiparallel alignment of the inverted 
crossover sites. However, as discussed in Section 2, the DNA segments involved in the invertasome are 
only several dozen bp long. This short length makes it highly unlikely a compensating supercoil could be 
introduced within the invertasome to give the required parallel alignment of the inverted crossover sites. 

7. Conclusions and directions for future research 

We have successfully extended the tangle model to treat equations involving composite knots in most 
cases. To do this, we first built on results on exceptional Dehn fillings at maximal distance. We then showed 
that, for any prime tangle, there are no rational tangle attachments of distance greater than one that first 
yield a 4-plat and then a connected sum of 4-plats. Next, we applied the tangle model to describe the 
rational tangle solution to processive Hin recombination on mutated sites. Finally, this was combined with 
our surgery results to classify all tangle solutions arising from distributive recombination mediated by Hin 
(Theorem 6.2). 

Our work confirms the biologically predicted configuration for the invertasome (modulo one undesircd 
solution, that can be excluded on biological grounds). Further, the outside DNA attains a configuration that 
is the expected one, with localised knotted arcs trapped outside the invertasome, as shown in Figure 11. 

We emphasize that although we have illustrated this extended tangle model with the Hin recombinase, 
that the lens space connect sum surgery restrictions, as developed in Section 4, are completely general. Thus, 
these can be applied to a variety of site-specific recombination systems where composite knots arise as a 
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result of distributive recombination. In particular, the Gin recombinase has been known to yield composite 
knots [30]. 

7.1. Directions for future research. Experimentally, it would be useful to obtain electron micrograph 
images of reaction products of the form 6(3, 1)^6(7,3), which we predict would occur as the result of an 
additional round of recombination in our model. The existence of these products would eliminate solutions 
C and D in Theorem 6.2, as discussed in the previous section. 

As remarked in Section 3, experiments indicate the Hin invertasome behaves as a 3-string tangle. One 
other possible avenue of research would therefore be to model Hin as a 3-string tangle. Darcy, Luecke and 
Vazquez [16] and Cabrera Ibarra [8], [9], [10] have published some results in this direction. In particular, 
Cabrera Ibarra gives results on the classification of 3-string tangle solutions to processive recombination by 
Gin, a protein very similar to Hin. We plan to extend these results to the setting of distributive recombination. 

Finally, the most natural direction in which to extend our results would be to obtain more information 
on the d{P, R) = 1 case. This could be then be applied to a variety of site-specific recombination systems, 
including distributive recombination of Hin on standard sites [27], and processive followed by distributive 
recombination of Gin on directly repeated sites [38] , [30] . Additionally, classifying such surgeries at distance 
one would be an interesting mathematical problem on its own. We are currently using knot Floer homology 
[33] to address this question. 
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